Unfolding times for proteins in a force clamp 
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The escape process from the native vaUey for proteins subjected to a constant stretching force is 
examined using a model for a /3-barrel. For a wide range of forces, the unfolding dynamics can be 
treated as one-dimensional diffusion, parametrized in terms of the end-to-end distance. In particular, 
the escape times can be evaluated as first passage times for a Brownian particle moving on the protein 
free-energy landscape, using the Smoluchowski equation. At strong forces, the unfolding process can 
be viewed as a diffusive drift away from the native state, while at weak forces thermal activation 
is the relevant mechanism. An escape-time analysis within this approach reveals a crossover from 
an exponential to an inverse Gaussian escape-time distribution upon passing from weak to strong 
forces. Moreover, a single expression valid at weak and strong forces can be devised both for the 
average unfolding time as well as for the corresponding variance. The analysis offers a possible 
explanation of recent experimental findings for ddFLN4 and ubiquitin. 



Single-molecule pulling experiments have become an important and widely used tool for examining mechanical prop- 
erties of proteins [Ij. These experiments have stimulated a renewed interest in the escape processes from metastable 
potential wells in the presence of a biasing force ^ . Traditionally, the dependence of the escape rate k on the stretching 
force F has often been modeled using the phenomenological Bell formula k{F) = ko qP^^^ [3|, where is the distance 
from the native to the transition state and assumed constant (/? = l/k^T, with k^ being Boltzmann's constant and 
T the temperature). The zero-force rate fco satisfies fcg oc e~^^'-^°, where AGq is the escape free-energy barrier at 
zero force. There are, however, uncertainties about how to extract the zero- force properties kg, and AGq from 
observed escape rates at non-zero force. One problem is the unknown constant of proportionality in the expression 
for fco- Another difficulty is that the distance a;u between the native free-energy minimum and the unfolding barrier, 
which is assumed constant in the Bell formula, generally depends on the applied force. 

To address these problems, several generalizations of the Bell formula have recently been proposed [1, Most 
of the extensions are based on the same underlying picture as for the Bell formula; the protein is viewed as a Brownian 
particle moving in a tilted one-dimensional potential, G{x) = Go{x) — Fx, where Gq{x) is the equilibrium free-energy 
profile. Using different approximations and parametrizations of Ga{x), key properties of the escape process have been 
analyzed, like the mean and variance of the rupture force at constant velocity pulling 0, @|- K was further shown Q 
that the approach of Dudko, Hummer and Szabo (DHS) [4] is able to describe experimentally observed deviations 
from the Bell formula for the protein ddFLN4 f2]. 

These extensions based on Kramers theory ^ assume that the escape barrier is high compared to fceT, leading to 
single-exponential kinetics. Very recently. Yew et al. analyzed deviations from single-exponential kinetics in unfolding 
simulations based on a Cq model [13] ■ By including the next-to- leading term in an eigenfunction expansion, they 
obtained an improved description of the unfolding dynamics at strong force. However, a comprehensive picture 
describing k{F) and the full escape-time distribution at both weak and strong force is still missing. A key parameter 
when describing the force dependence is the critical force Fc, at which the escape barrier disappears. In the DHS 
approach [1,13], one has Fc — AGq/i^Xu, where v is a model parameter {i^ — 1 corresponds to the Bell formula). The 
above-mentioned ddFLN4 analysis \§\ (with = 1/2 or 2/3) suggests that Fc ^ 80-1 10 pN for this protein. For the 
titin module 127, on the other hand, Fc appears to be significantly larger {AGq/xu ~ 640 pN [H]). Due to different 
Fc, when analyzing experimental data, the strong-force regime F > Fc may or may not be relevant, depending on the 
protein. 

In this Letter we investigate the response of a model protein to a wide range of constant pulling forces. We show 
that, once the free-energy landscape is known with sufficient accuracy, the usual Smoluchowski equation [§] in one 
dimension is sufficient to obtain a good estimate of the average escape time from the native valley and the associated 
variance. Two force regimes, separated by the critical force Fc, are observed. For F < Fc, unfolding occurs through a 
thermally activated escape process. For F > Fc, the unfolding dynamics can instead be interpreted as pure diffusion 
with an external bias. The transition from the weak- to the strong-force regime is accompanied by a drastic change 
in the shape of the escape-time distribution, from exponential to inverse Gaussian. The applicability of this approach 
to real proteins, at forces studied experimentally, is addressed using recently reported data for ddFLN4 [3, 4] and 
ubiquitin [il,[i|]. 



The protein model we consider is the 3D ofF-lattice BPN model [3, [3, [3 1 where each residue is represented by 
a single point and is of one of the following three types: hydrophobic (B), polar (P) or neutral (N). We study a 
46-residue sequence which is known to form a four-stranded /3-barrel in its native state. The folding [3, [H, [H, [l3| 
and mechanical unfolding [3, of this sequence have been extensively studied. We analyze via Langevin dynamics 
the response of this model protein to external forces acting on the chain ends in proximity of its folding temperature, 
namely at T = 0.3 Parameters values are as in Ref. [l^ and all model quantities are dimensionless; for a 

comparison with physical units, see Ref. (Tg! ]. 




FIG. 1; (Color online) Free energy Go{() at zero force for the BPN protein, calculated as a function of the end-to-end distance 
The positions of the native state, (o « 2.0, and the saddle, (s ~ 5.25, are indicated. The inset shows the escape barrier AG 
versus F. The vertical (blue) and horizontal (green) lines indicate Fc and the zero-force barrier, AGq. 

A typical unfolding trajectory begins with a waiting phase, where the end-to-end distance ^ stays close to its 
native value. This phase is followed by a sudden increase in ^. A fundamental question is whether the escape from 
the native valley can be effectively described as one-dimensional diffusion, parametrized in terms of C- Based on 
this assumption the unfolding process is commonly described as the motion of a point-like Brownian particle in the 
potential G(C) — Go(C) — FC,, where Go(C) is the equilibrium free-energy profile. The average first passage time t{x) 
at a threshold Cs for a particle with initial position x G [Co? Cs] can be obtained by solving the Smoluchowski equation. 
One finds that Q 

t{x) = PMj f^' dy e'^^^y') T dz e'^^^^^ (1) 

Jx J Co 

where M is the particle mass and 7 the damping constant. The boundaries at ^0 and Cs are reflecting and absorbing, 
respectively. When using Eq. ([T]) to calculate the escape time from the native valley, Co is the native C and is that 
of the saddle, or barrier, to be crossed. The escape time is obtained as ts = t(Co)- In our simulations, escape times 
are measured using a threshold slightly larger than , to avoid saddle recrossing 9] . 

We begin by testing the escape-time prediction ts directly against simulation results for the BPN protein, without 
making any further assumption on the form of G{(). For this purpose, we determine G(C) numerically, using methods 
described in Ref. [l^. Fig.[l]shows the calculated free-energy profile at zero force, Go{C), which exhibits a pronounced 
native minimum at ~ 2.0 and a barrier at Cs ~ 5.25. The height of the barrier is AGq — Go(Cs) — Go(Co) ~ 5.62. 
The application of a stretching force F tilts the free-energy landscape to G(C) = Go(C) — -^C and reduces the barrier 
height AG. As shown in the inset of Fig. [1] AG decreases almost linearly with F. The barrier finally disappears at 
Fc w 1.83. 

Knowing G(C), the escape-time prediction ts can be obtained by numerically evaluating the double integral in 
Eq. 11]). In Fig. m we compare ts with simulated escape times. The agreement is very good for strong forces (F > 3) 
as well as at weak forces {F < 1.2). Due to computational limitations, it was impossible to investigate forces < 0.6. 
The regime in which the simulated escape times are most difficult to reproduce is around the critical force Fc, where 
there is no clear free-energy gradient either towards or away from the native state. In this regime, the details of 
the free-energy profile matter. It is remarkable, however, that this simple picture, without employing any fitting 
parameter, is able to describe the behavior at both strong and weak forces, despite escape-time differences of almost 
six orders of magnitude. 

This analysis, based on the full profile G(C), addresses in a direct manner the question of whether or not the system 
can be described in terms of one-dimensional diffusion. In unfolding experiments, G(C) is unknown, and the challenge 




FIG. 2; (Color online) Average escape time against force for the BPN protein. Filled (red) circles are simulation results and 
the (black) curve is the prediction ts obtained from Eq. ((TJ, with 7 — 0.05 and M — 46. The vertical (magenta) line indicates 
Fc- The dotted (green) line is the estimate tl in Eq. ([2]), with M and 7 as above and a = 3.25. The inset shows the variance, 
V. Filled (red) circles are simulation results, whereas the (black) curve and the dotted (green) line represent the estimates Vs 
and Vl, respectively. 



is to extract the main features of the free-energy landscape from measured escape times. This task is greatly facilitated 
if the free energy can be linearly approximated in the interval [Co, Cs], as G(C) = {Fc — F){( — (q) (up to an additive 
constant). With this approximation, the integrals in Eq. ([T]) can be evaluated analytically. The resulting expression, 
for the average escape time of a diffusive particle in one dimension in the presence of a bias {F in the present context), 

is a 
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where a = Cs ^ Co is the distance between the reflecting and absorbing boundaries. Unlike the result reported in 
Ref. Eq. ^ includes the effect of a zero- force barrier, represented by the term Fca. The singular terms at F = Fc 
in Eq. ^ cancel out, as they should. 

The assumption that G(C) is linear between the native state and the saddle is quite well satisfied for the BPN 
protein (see Fig. [1]) . Actually, the escape times obtained using this approximation, tl , essentially coincide with the 
estimated ts obtained using the full G(C), as can be seen from Fig. [2] Note that Eq. ([2]), like Eq. ([1]), has no parameter 
that needs to be fitted, because we can use the value of Fc previously determined. While Eq. ^ well describes the 
escape time down to the lowest forces that could be studied, one should still be cautious in using this expression to 
extrapolate to zero force, because a "turnover" to a force-independent process is likely to occur at weak force [2]| . 
The extent of this weak-force regime might be non- negligible if the temperature is high [2l[ . 

The variance of the escape time is in the Smoluchowski approach given by Vs ~ T2,s where the second moment 
T2.S reads [22[ 
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Like the mean, the variance can be obtained analytically if G(C) depends linearly on C. This estimate of the variance, 
Vl, can be found in Eq. (SI) [1^. The inset of Fig. [5] shows our simulation results for the variance of the escape time 
for the BPN protein, along with the estimates Vs and Vl- For F < Fc, Vs and Vl are almost identical; while for 
F > Fc, Vl is slightly larger than both Vs and the simulation results, although the corresponding three average times 
are very similar in this regime. Overall, both Vs and Vl agree well with the simulation results. 

It is informative to go beyond the first and second moments and also study the full probability distribution of the 
escape time. For F < Fc, we find that the escape-time distribution of the BPN protein to a very good approximation 
is exponential, P{t) = r~^e~*/'^, with t being the mean (see Fig. [3^). This observation confirms that at weak forces, 
where a free-energy barrier is still present, the main escape mechanism is thermal activation. At F ~ Fc, the escape 
process changes in character, from a thermally activated process to a diffusive process driven by an external bias 
(force). In the latter regime, it is known that first-passage times follow a so-called inverse Gaussian distribution [24j . 
This distribution is given by 



where t is the mean and T ^ V/t, V being the variance. This expression indeed provides a very good description of 
our simulation results at strong forces, as illustrated in Fig. It should be noticed that this comparison does not 
involve any parameter fitting, because r and V are determined directly from the simulations. 
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FIG. 3: (Color online) Escape-time distribution P{t) for the BPN protein at two different forces. Large (black) dots are 
simulation results, a,) F — 0.6 (< Fc). The (red) curve is an exponential (r = 7.02 ■ fO^). h) F = 2.2 (> Fc). The (red) curve 
is an inverse Gaussian (r = 7f .f , V = f .35 ■ fO"^), whereas small (blue) dots represent a log-normal fit to data. 

Previous studies have used a log-normal distribution, rather than the inverse Gaussian, to describe the escape-time 
distribution at strong forces [2^,|2a|. While the log- normal distribution is similar to the inverse Gaussian (see Fig.[3l3), 
there is no theoretical background to justify its use in the present context. The inverse Gaussian distribution is, by 
contrast, known to arise from biased Brownian motion [23 |. which provides a simple physical picture of the unfolding 
dynamics at strong forces. 

Having seen that our approach provides a good description of the unfolding dynamics of the BPN protein, we now 
turn to two real proteins, ddFLN4 and ubiquitin. Two results of the above analysis are particularly useful when 
comparing with experimental data. The first is Eq. ^ , which provides an approximate closed-form expression for the 
average escape time t{F) at both weak and strong forces. The second result is that the onset of the non-exponential 
strong-force behavior of t{F) is accompanied by a change of shape of the escape-time distribution, from exponential 
to inverse Gaussian. 
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FIG. 4: (Color online) Average unfolding time versus force for ddFLN4. The symbols (explained in detail in Ref. ^3|) represent 
experimental data, originally obtained at constant velocity [j and transformed to constant-force conditions in Ref. 0]. The 
(black) curve is a fit of Eq. ^ (M7 = 0.012 pN • s/nm, a = 1.1 nm and Fc = 60 pN). The inset shows ubiquitin data from AFM 
force-clamp experiments reported in Ref. [ia |. 

Experimental unfolding times for ddFLN4 show, as mentioned earlier, clear deviations from the Bell formula fs', '3]. 
It has been demonstrated [3] that the DHS approach [4] describes the data well. In Fig.[4l we show a fit of our Eq. ^ to 
the same data. The fit is good, and the fitted values a = 1.1 nm (corresponding to a^u) and AGq = FcU = 9.50 kcal/mol 
are consistent with the results of Ref. Q. Unlike the DHS approach, ours does not assume the escape barrier to be 
high. For ddFLN4, our fit to the t{F) data indicates that the barrier disappears already at F^ ~ 60 pN. It would be 



very interesting to see whether the escape-time distribution is inverse Gaussian at, say, 100 pN, but this distribution 
has not been evaluated, as far as we know. 

For ubiquitin, the escape-time distribution has been measured experimentally at llOpN [13] ■ The data were found 
to be well described by a log-normal distribution , which is very similar to the inverse Gaussian one found above 
at strong forces. Our approach thus offers an explanation of the shape of the observed distribution. This explanation 
requires that Fc < 110 pN. Very recent experimental t{F) data for ubiquitin [l3| show signs of deviations from the 
Bell formula (see inset of Fig. |4|). However, it was found that the data could not discriminate between the Bell and 
DHS formulas [l^. Neither are the data sufficient to permit a stable fit of Eq. which would have given us an 
independent estimate of Fc. The assumption that Fc < llOpN seems, however, fully consistent with the experimental 
t{F) data. 

In this Letter we have shown for a model protein that the unfolding process from the native valley under force- 
clamp conditions can be modeled as a Brownian motion in a tilted one-dimensional free-energy landscape. Moreover, 
it turned out that this description could be further simplified with a surprisingly small loss of accuracy, by adopting a 
linear approximation for the free energy. This analysis links deviations from the Bell formula for k(F) for F > Fc to 
an altered shape of the escape-time distribution, from exponential to inverse Gaussian. Comparison with experiments 
indicates that the strong-force regime might set in at relatively weak force {Fc < 100 pN) for both ddFLN4 and 
ubiquitin. 
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Supplementary Information 

Variance of the unfolding time in the linear approximation 

The variance of the unfolding time as obtained in the Smoluchowski approach is given in the Letter, for a general free 
energy G{Q = G'o(C) — -^C- The integrals in this expression can be evaluated analytically if the free energy is assumed 
to be linear in the interval [(^q, (s], G{C) — (Fc ~ F){( — (q). At external force F, the variance is then given by 

V, = ''''fjplf (l + 2e-^(^--^^^) + ^-ffi^ (^e-m^-^^)^ + 4e-^(---^)'^ - 5) (SI) 

where the symbols are as in the Letter. This result extends the one reported in equation (2.2.31) of Ref. [1], for a 
particle diffusing in one dimension in the presence of a bias {F in our context), to the case where a barrier (represented 
by the term Fcu) is present also at zero bias. 



Strong force limit 

The expressions for the average unfolding time and the associated variance, in the linear approximation, can be 
simplified for strong forces [1], 

Using these expressions, it is, in principle, possible to extract the critical force (Fc) and the distance to the transition 
state (a), and thereby also the zero- force barrier (AGq — Fc x a), from escape-time measurements at strong forces. 

Description of the experimental data for ddFLN4 

Fig. 4 in the Letter shows unfolding times at different forces for the ddFLN4 protein. The data have been extracted 
from Fig. 2b in Ref. The results in Ref. [§| were based on constant- velocity AFM pulling experiments by Ref. [3]. 
Rupture-force histograms obtained by Ref. [j] were transformed in Ref. Q into force-dependent unfolding times 
measurable at constant force. The symbols in Fig. 4 are as in Ref. and correspond to different pulling velocities 
in the original experiments: v — 200nm/s (blue squares), v — 400nm/s (green triangles); v ~ 2,000nm/s (yellow 
diamonds), and v = 4,000nm/s (red stars). 
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